首页> 外文OA文献 >Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos

【2h】

Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos

机译：用于检测视频中多个时空动作管的深度学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this work, we propose an approach to the spatiotemporal localisation(detection) and classification of multiple concurrent actions within temporallyuntrimmed videos. Our framework is composed of three stages. In stage 1,appearance and motion detection networks are employed to localise and scoreactions from colour images and optical flow. In stage 2, the appearance networkdetections are boosted by combining them with the motion detection scores, inproportion to their respective spatial overlap. In stage 3, sequences ofdetection boxes most likely to be associated with a single action instance,called action tubes, are constructed by solving two energy maximisationproblems via dynamic programming. While in the first pass, action pathsspanning the whole video are built by linking detection boxes over time usingtheir class-specific scores and their spatial overlap, in the second pass,temporal trimming is performed by ensuring label consistency for allconstituting detection boxes. We demonstrate the performance of our algorithmon the challenging UCF101, J-HMDB-21 and LIRIS-HARL datasets, achieving newstate-of-the-art results across the board and significantly increasingdetection speed at test time. We achieve a huge leap forward in actiondetection performance and report a 20% and 11% gain in mAP (mean averageprecision) on UCF-101 and J-HMDB-21 datasets respectively when compared to thestate-of-the-art.

机译：在这项工作中，我们提出了一种方法来对时空定位的视频中的多个并发动作进行时空定位（检测）和分类。我们的框架包括三个阶段。在阶段1中，使用外观和运动检测网络对彩色图像和光流进行定位和刻划。在阶段2中，通过将外观网络检测与运动检测得分结合起来，使其与各自的空间重叠不成比例，来增强外观网络检测。在阶段3中，通过动态编程解决两个能量最大化问题，构造最可能与单个动作实例（称为动作管）相关联的检测框序列。在第一遍中，通过使用特定于类的得分及其空间重叠将检测盒随时间链接起来，从而构建跨越整个视频的动作路径，在第二遍中，通过确保所有组成的检测盒的标签一致性来进行时间修剪。我们证明了我们的算法在具有挑战性的UCF101，J-HMDB-21和LIRIS-HARL数据集上的性能，全面实现了最新的结果，并在测试时显着提高了检测速度。与最新技术相比，我们在动作检测性能上实现了巨大的飞跃，并报告了UCF-101和J-HMDB-21数据集的mAP（平均平均精度）分别提高了20％和11％。

著录项

作者
Saha, Suman; Singh, Gurkirt; Sapienza, Michael; Torr, Philip H. S.; Cuzzolin, Fabio;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. A deep learning spatial-temporal framework for detecting surgical tools in laparoscopic videos [J] . Alshirbaji Tamer Abdulbaki, Jalal Nour Aldeen, Docherty Paul D., Biomedical signal processing and control . 2021,第Pta2期

机译：腹腔镜视频中检测手术工具的深层学习空间框架
2. Detecting non-hardhat-use by a deep learning method from far-field surveillance videos [J] . Fang Qi, Li Heng, Luo Xiaochun, Automation in construction . 2018,第JANa期

机译：通过深度学习方法从远距离监视视频中检测非安全帽使用情况
3. Detecting anomalous events in videos by learning deep representations of appearance and motion [J] . Dan Xu, Yan Yan, Elisa Ricci, Computer vision and image understanding . 2017,第Mara期

机译：通过学习外观和运动的深刻表示来检测视频中的异常事件
4. Deep Learning-based Quantitative Steganalysis to Detect Motion Vector Embedding of HEVC Videos [C] . Xiongbo Huang, Yongjian Hu, Yufei Wang, IEEE International Conference on Data Science in Cyberspace . 2020

机译：基于深度学习的定量隐写分析来检测HEVC视频的运动矢量嵌入
5. A Deep Learning Approach to Detecting Dysphagia in Videofluoroscopy [D] . Wilhelm, Patrick T. 2020

机译：一种探测吞咽困难的深入学习方法
6. Vision and Deep Learning-Based Algorithms to Detect and Quantify Cracks on Concrete Surfaces from UAV Videos [O] . Sutanu Bhowmick, Satish Nagarajaiah, Ashok Veeraraghavan 2020

机译：基于视觉和深度学习的算法从UAV视频中检测和量化混凝土表面上的裂缝
7. Deep learning for detecting multiple space-time action tubes in videos [O] . Saha, S, Singh, G, Sapienza, M, 2016

机译：深度学习可检测视频中的多个时空动作管

Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅